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We consider a variant of so called power-law random graph. A sequence of expected 
degrees corresponds to a power-law degree distribution with finite mean and infinite 
variance. In previous works the asymptotic picture with number of nodes limiting 
to infinity has been considered. It was found that an interesting structure appears. 
It has resemblance with such graphs like the Internet graph. Some simulations have 
shown that a finite sized variant has similar properties as well. Here we investigate 
this case in more analytical fashion, and, with help of some simple lower bounds for 
large valued expectations of relevant random variables, we can shed some light into this 
issue. A new term, 'communication range random graph' is introduced to emphasize 
that some further restrictions are needed to have a relevant random graph model for 
a reasonable sized communication network, like the Internet. In this case a pleasant 
model is obtained, giving the opportunity to understand such networks on an intuitive 
level. This would be beneficial in order to understand, say, how a particular routing 
works in such networks. 



1 Introduction 

Since the pioneering work by three Faloutsos brothers, jlj and some other groups 
like that of Barabasi's [3], around the millennium, so called power-law graphs, 
with degree sequence obeying power-law distribution with finite mean and in- 
finite variance, have attracted high interest by several authors. This degree 
sequence is argued to reflect some fundamental aspects of some communication 
networks and some other natural and technological networks. It appears that 
this concept can be turned into a mathematical object in several ways, with 



provable properties, see for instance [TH [21 [3 HI HI HH]- However, for the sake 
of tractability, asymptotic regime with growing number of nodes, N ^ oo, has 
been most popular. Some quite interesting results were obtained [IT1[T2]. For 
instance, it was found that a random graph variant of this model, where with 
a given degree sequence links are drawn as randomly as possible, produces a 
graph with some characteristic properties that has correspondence with such 
network as the Internet itself, at its autonomous systems level (AS-graph). A 
kind of 'soft-hierarchy' of large degree nodes is formed only due to combinatorial 
probabilities. It is convenient to categorize nodes with increasing degrees into 
'tiers' with nodes in certain subsequent intervals of degrees. It is sufficient to 
consider approximately log log tiers. This can be done in a way that a node 
in a tier has at least one link to upper-laying next tier, with probability tending 
to 1, asymptotically. The hierarchical part of nodes was called the core. Thus 
very short (log log A^) paths exist from the bottom to top of the core. It was 
also shown that even shorter paths are needed to find this core for almost any 
node in the same component. Thus, a log log A^ asymptotical upper bound for 
distance in the giant component was established. These results with extensions 
were obtained independently by Chung and others, [2 |7j, using some refined 
methods of random graph theory. However, we found our more elementary ap- 
proach with the concept of the core also very useful giving an intuitive insight. 
Refined variants of the theory can be found in works of van der Hofstad and 
others, showing, among other issues, that the log log A^ upper bound is the 
best possible . 

However, it was apparent from the proofs, that convergence can be very slow, 
involving such functions like 1 /log log A^ with limit 0, that is approached only 
with 'unrealistically' large values of A^, certainly unimaginable in the framework 
of communication networks. Some simulations indicated that in spite of this 
some reasonably sized graphs have properties that are similar to this asymptotic 
picture. That is why it is interesting to study this question in more details using 
an analytical approach. Here we do some first steps into this direction. 

It is also interesting, that the asymptotic model indicates interesting con- 
sequences for such graphs robustness against targeted attacks against the top 
level nodes. Such graphs show good robustness against such failures, at least in 
the terms of the distance: the remaining tiers are able to maintain connectivity 
with the price of only an insignificant number of extra hops. However, here it 
is also important to know how valid these results are for reasonable graph sizes. 
Recently, possibilities of extending the basic model by 'redirecting' the links, 
have been discussed. Here it is possible that the asymptotic picture is 'non 
convergent', meaning that it tells nothing about the finite variant of the graph. 

2 Model definition and asymptotic results 

We consider a variant of power-law random graph, similar to one proposed by 
Chung and Lu [7|, see also: [Ml [13] . A natural number, A^ > 0, is the number 
of nodes in the graph. Nodes are labeled with natural numbers 1, 2, • • • , A'. A 



node with label i, 1 < i < N has 'capacity', = {N/i)", with real number 
a, ^ < a < 1, which reflects the power-law degree sequence. For each possible 
unordered pair of nodes {i,j}, i,j £ {1, 2, • • • , N}, we associate the number of 
links between those nodes as a random variable Ei,j with Poisson distribution, 
with expected value E{Eij) = ^i^j / J2iLi ^i- AH these random variables are 
considered as independent. In shorthand we write Eij = Po{XiXj / Ln), with 
Ljv = X^i^i ^i- Thus multiple links and self-loops are allowed. However, such 
artifacts are not too harmful, since the vast majority of these variables take 
values or 1 in a large enough graph. The expected degree of node i, di, is 
thus di = E(X]^i Ei^.j) = ^i^jl Z^ili due to a basic property for 

the sum of independent Poisson distributed random variables. Thus has the 
meaning of expected degree of node i. 

Let us define the following sequence of functions: 

*(«,^^.^ a, 

P,{N) = (T-2)/3,_i(7V) + e(iV), i = l,2,-.. 

with = a, e{N) = 1{N)/ logN and is a very slowly diverging function 
as its argument grows to infinity. 
We define the 'upper layers' as 

Uo = {l}, C/, ={i:A, >7V^^W}, j = l,2,---. 

Provided that fulfills: 1{1) = 1, I (A^)/ log log log iV and 

1{N) I log log log log — > oo, we had the following result for the power-law graph 
described above: 



Theorem 2.1 Let 



fc* = A:*(7V) 



log log iV 



-log(T-2) 

where [•] , denotes the least integer greater than or equal to its argument. 
Then the hop-count distance between two randomly chosen vertices of the gi- 
ant component, which exists asymptotically almost surely (a.a.s.), is less than 
2k*{N){l + o{l)), a.a.s. 

We define the core, C, as the upper layer Uk*. Later on this proposition was 
strengthened considerably, one such result being that this upper bound is tight 
[8]. However, such more detailed analysis is very involved and that is why we 
prefer to stay on the level of simple upper bounds, also in what follows. Roughly 
speaking, the idea of our proof of Theorem l2.1l is that the probability that a node 
i in any layer Uj has a link to the upper layer Uj-i with probability tending 
to 1 as iV ^ CX3 (we write this as: P{Uj 9 j ^ Uj-i) 1). Thus, it takes at 
most k* hops to travel from the lowest layer to the top degree node. Further, 
almost any node within the giant component has a path to some node in Uk* , 
with a number of hops that is sub-linear with log log A^. Thus, the upper bound 



follows. However, as we see in the next section, the convergence of probability to 
1 can be very slow, say, P {Uj 9 i ^ Uj-i) > 1 — log log n ' c > 0, a convergence 
rate that is practically 'unobservable' in our framework. Thus it is a relevant 
question whether this asymptotic picture tells anything about a graph with only 
a reasonably large number of nodes. Some simulations seem to indicate that the 
answer is positive, see e.g. [inilllllll]. However, we found that in order to find a 
corresponding above described layer structure in a finite model one must define 
function l(-) in some particular way, not prescribed by its asymptotic behavior 
only. In this paper our aim is to explain such circumstances and indicate a way 
how such finite sized random graphs can be analysed, and thus to make such 
random graph models more usable for modeling communication networks. 



3 Analysis of 'communication range' graphs 

It is easy to see that the cardinality of layer j > is: | Uj |= |^A^i~ft(^)/"J = 
|^;V'1-(t-i)/3j(.'V)j ^ where, [-J is the largest integer smaller or equal to the argu- 
ment. Thus we have a lower bound for the sum of capacities in a layer j > 0: 

and V{Uq) — N". For we have asymptotically linear scaling with N. Indeed, 
it is easy to see that Ln > A^" [l + x^^dx > jz^{N ~ N") > j^N, where 
c can be taken arbitrarily close to 1, provided that N is large enough. For 
instance, c = 9/10 is valid provided that N > IQt^. However, notably these 
bounds are not uniform with a. For a, close to 1, we would have to choose a 
small value for c, for any reasonable N. This is a general trend here, since we 
must also fix the range of a more precisely, not just stating that 1/2 < a < 1, 
which was sufficient for the asymptotic analysis. This circumstance reflects the 
fact that the asymptotic regime is approached sensitively with respect to a. 
Our hypothesis is that for communication networks with N in reasonable range 
of thousands of nodes or tens thousands of nodes, it is necessary to have a in 
the lower half of the range (0, 1) then the asymptotic range is reasonably close. 
Luckily, in the case of the Internet, this is a range of a that has been observed. 
We call this range of N and a, the communication range. Similarly we have: 

with constants Cj, close to 1, provided N and t are in the communication range. 
Within the same range, we can easily find a lower bound for the probability that 
a node in layer j has at least one link to layer j — 1: 

PiUj{N) 3 I ^ Uj^i{N)) = 1 - cxpi-\^ViUj-i)/LN) (1) 

> l_exp(-^e'W) =p,. 



This relation also shows the delicacy of the communication range, where we can 
approximate Cjjc by 1 — otherwise we would need a number far from 1, giving 
a big effect to the lower bound po- Say, if a is close to 1, this ratio would be 
a big number resulting in a very low probability. Notably, po is also sensitive 
to the choice of function Z(-). In the asymptotic sense these circumstances are 
irrelevant, since in any case po ~* 1 as — > cxd, if only Z(A^) — > oo. 

We wish to show that in communication range, a particularly defined 'core' 
has a similar role as in the asymptotic graph. In particular, the log log iV scaling 
of distance is roughly valid. We show that the lower bound of expected number of 
nodes that have a link to a core node that has a path through the core hierarchy 
up to the top is large enough and suggests that the log log upper bound is 
valid within the communication range, for the vast majority of nodes. 

Assume fixed iV and take a natural number a; > 0. The probability that a 
node in layer Ux-, has a link to Ux-\ is lower bounded by 'Px- The probability that 
the same node has a path to Ux-2-, through Ux-\, is lower bounded by -pxVx-i- 
And so forth, probability that the same node has a path to the top node 1, going 
through at most x layers, is lower bounded by PxPx-i • ■ 'Pi- Denoting by Cx the 
minimal ratio ^ in relation ([T]), in the corresponding range of j, and denoting 

Po = l-exp(-c,e(3-)'W), (2) 

we find that the probability that a node in layer Ux has a path described above 
is lower bounded by pg. Note that, in asymptotic range, po tends to 1 (very 
slowly), however, within our finite range this is an important parameter affecting 
the quality of bounds. 

Denote by C Ux the nodes in layer Ux having a valid path with upmost 
X hops within the subsequent layers to the top node 1. As a result we have: 

E|f/:I>PSIC^. I (3) 
> NP'^pI \Ux\. 

Denote by Nx the nodes that are in U'^ or have a link to a node in it. 
The probability that a node outside U'^ has link to it is lower bounded by 

1 — exp(— ^^^""^ ) > \ ^l!n ^ ' ^^^^ "^tiyj for the conditional expectation, we 
have: 



2 L 



N 



Therefore, according to 



E(| Nx I) > -^^Pl \Ux\. (4) 

The task is to maximise the bound in a way that x is not too large. 
It appears that asymptotically we end up with the setting that is in line with 
Theorem 12.11 However, it is also possible to find a 'setting' of and x that 
corresponds to asymptotic-like behavior in the communication range. To get 



a qualitative picture, we make simplifications in relation ([J), assuming all con- 
stants, that are close to 1, equal to one. As a result we get an approximate lower 
bound for E(| |) > s{xj): 

s{x,l)fiiN ^-1 (1 - exp(-m))^m 3-^, m = exp((3 - r)Z). (5) 

The maximum is found as solution of equations ^^g^''-* = 0, ^^g^'''' — 0, and by 
comparing values of the function in the closest integer arguments. Although the 
equations are not solvable in closed form, the first one yields the relation: 

ds{x,l) loglogiV /log(l/(l - exp(-m)) 
-^^"^"^ -log(r-2) +^°H -~log(r-2) 

The first term is analogous to the one in Theorem 1 2. 1[ the next one depends on 
the choice of I through m(/), which should be found from the second equation. 
However, we can see the asymptotical regime from these equations. Indeed, we 
see that the relevant factor of s, with respect to argument I, is asymptotically 
exp ( j;^^) ^m-i^ exp [ ioe{r^2}l^p{m) ) ' ^l^ere we took into account 
the equation The second term suggests that m should be an increasing 
function, at least threefold logarithm, and I should be no slower than fourfold 
logarithm. The first term suggests that it should not be too fast, in this respect 
the lowest possible would be the best, and gives the maximum. However, as 
suggested by the Theorem 12.11 the leading term is indifferent with respect to 
this range. Indeed, if we make the corresponding substitution to (O, we see that 
the leading term is just k*. 

If we substitute these asymptotic estimates, as arguments to ([5]), we find 

s{xJ)(xNeM:^iT-2)l{N)), (7) 

which is only slightly lower than N. Thus, the factor exp(— (r — 2)1{N)) has the 
meaning of lower bound expected 'density' of neighbors of the core nodes, with 
valid paths to the top node. This density is almost constant, as a function of 
N, between 1 / log log log and 1/ log log A^. In the communication range, one 
would guess that the best choice would be 1/ log log log A^. Numerical calcula- 
tions seems to support this, see Figured] Our next plot, in Figure [21 indicates 
that in this range the choice of this function has some effect. By taking for 
the X its optimal value x* , we see that the function s{x*,l), see[5l has a visible 
maximum. This numerics showed also that in a very wide range of A^, from 100 
to 10"'^'', a rather constant fraction of nodes, in lower bound expectation, are 
neighbors of core with valid paths to the top. In our case with r = 2.5, this was 
around 20 percent. As a result, a random node is able to find a node in the core 
that has a path to its top, with a moderate sized search: approximately every 
fifth node is of this type. 

This kind of 'quasi-stationarity' or extremely low dependence on A^ should be 
good news, since it simplifies the usability of such models. For instance, s{x,l) 
and I can be taken as constant parameters hardly changing in any reasonable 



- 1, (6) 



Figure 1: Plot of functions, calculated in 
points TV = 10''+\k = 1,2, •■■,9: a;(iV), 

iogiA.-2) log log ^. KN), log log log TV 
and log log log log TV, listed from top to 
down. 



Figure 2: Plot of function s{x*,l), with 
TV — 10'' and with fixed first argument 
with value x* , that it takes at maximum. 



range. We also see that the height of the core is almost constant, and its major 
term is a function of type k* (TV) . One drawback is that we have only lower bound 
type results and some unrigorous estimates were done. However, it is quite likely 
that a thorough analysis will not reveal any substantial new features, although 
it is mandatory to check it. It would be interesting to compare this approach 
with a 'conceptual model' of the Internet, called the 'Jellyfish', [T5] . 
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